智能论文笔记

Pavlovian Signalling with General Value Functions in Agent-Agent Temporal Decision Making

Andrew Butcher , Michael Bradley Johanson , Elnaz Davoodi , Dylan J. A. Brenneis , Leslie Acker , Adam S. R. Parker , Adam White , Joseph Modayil , Patrick M. Pilarski

分类：人工智能 | 机器学习

2022-01-11

在本文中，我们为Pavlovian信号传达的多方面的研究 - 一个过程中学到的一个过程，一个代理商通过另一个代理商通知决策的时间扩展预测。信令紧密连接到时间和时间。在生成和接收信号的服务中，已知人类和其他动物代表时间，确定自过去事件以来的时间，预测到未来刺激的时间，并且都识别和生成展开时间的模式。我们调查通过引入部分可观察到的决策域来对学习代理之间的影响和信令在我们称之为霜冻空心的情况下如何影响学习代理之间的影响和信令。在该域中，预测学习代理和加强学习代理被耦合到两部分决策系统，该系统可以在避免时间条件危险时获取稀疏奖励。我们评估了两个域变型：机器代理在七态线性步行中交互，以及虚拟现实环境中的人机交互。我们的结果展示了帕夫洛维亚信号传导的学习速度，对药剂 - 代理协调具有不同时间表示（并且不）的影响，以及颞次锯齿对药剂和人毒剂相互作用的影响方式不同。作为主要贡献，我们将Pavlovian信号传导为固定信号范例与两个代理之间完全自适应通信学习之间的天然桥梁。我们进一步展示了如何从固定的信令过程计算地构建该自适应信令处理，其特征在于，通过快速的连续预测学习和对接收信号的性质的最小限制。因此，我们的结果表明了加固学习代理之间的沟通学习的可行建设者的途径。

translated by 谷歌翻译

Assessing Human Interaction in Virtual Reality With Continually Learning Prediction Agents Based on Reinforcement Learning Algorithms: A Pilot Study

Dylan J. A. Brenneis , Adam S. Parker , Michael Bradley Johanson , Andrew Butcher , Elnaz Davoodi , Leslie Acker , Matthew M. Botvinick , Joseph Modayil , Adam White , Patrick M. Pilarski

分类：人工智能

2021-12-14

人工智能系统越来越涉及持续学习，以实现在系统培训期间不遇到的一般情况下的灵活性。与自治系统的人类互动广泛研究，但在系统积极学习的同时，研究发生了迄今为止发生的互动，并且可以在几分钟内明显改变其行为。在这项试验研究中，我们调查如何在代理商发展能力时如何发展人类和不断学习的预测代理人之间的互动。此外，我们可以比较两个不同的代理架构来评估代理设计中的代表性选择如何影响人工代理交互。我们开发虚拟现实环境和基于时间的预测任务，其中从增强学习（RL）算法增强人类预测中学到的预测。我们评估参与者在此任务中的性能和行为如何在代理类型中不同，使用定量和定性分析。我们的研究结果表明，系统的人类信任可能受到与代理人的早期互动的影响，并且反过来的信任会影响战略行为，但试点研究的限制排除了任何结论的声明。我们将信任作为互动的关键特征，以考虑基于RL的技术在考虑基于RL的技术时，并对这项研究进行了几项建议，以准备更大规模的调查。本文的视频摘要可以在https://youtu.be/ovyjdnbqtwq找到。

translated by 谷歌翻译

Meta-Learning for Color-to-Infrared Cross-Modal Style Transfer

Evelyn A. Stump , Francesco Luzi , Leslie M. Collins , Jordan M. Malof

分类：计算机视觉

2022-12-24

Recent object detection models for infrared (IR) imagery are based upon deep neural networks (DNNs) and require large amounts of labeled training imagery. However, publicly-available datasets that can be used for such training are limited in their size and diversity. To address this problem, we explore cross-modal style transfer (CMST) to leverage large and diverse color imagery datasets so that they can be used to train DNN-based IR image based object detectors. We evaluate six contemporary stylization methods on four publicly-available IR datasets - the first comparison of its kind - and find that CMST is highly effective for DNN-based detectors. Surprisingly, we find that existing data-driven methods are outperformed by a simple grayscale stylization (an average of the color channels). Our analysis reveals that existing data-driven methods are either too simplistic or introduce significant artifacts into the imagery. To overcome these limitations, we propose meta-learning style transfer (MLST), which learns a stylization by composing and tuning well-behaved analytic functions. We find that MLST leads to more complex stylizations without introducing significant image artifacts and achieves the best overall detector performance on our benchmark datasets.

translated by 谷歌翻译

evoML Yellow Paper: Evolutionary AI and Optimisation Studio

Lingbo Li , Leslie Kanthan , Michail Basios , Fan Wu , Manal Adham , Vitali Avagyan , Alexis Butler , Paul Brookes , Rafail Giavrimis , Buhong Liu

分类：人工智能

2022-12-20

Machine learning model development and optimisation can be a rather cumbersome and resource-intensive process. Custom models are often more difficult to build and deploy, and they require infrastructure and expertise which are often costly to acquire and maintain. Machine learning product development lifecycle must take into account the need to navigate the difficulties of developing and deploying machine learning models. evoML is an AI-powered tool that provides automated functionalities in machine learning model development, optimisation, and model code optimisation. Core functionalities of evoML include data cleaning, exploratory analysis, feature analysis and generation, model optimisation, model evaluation, model code optimisation, and model deployment. Additionally, a key feature of evoML is that it embeds code and model optimisation into the model development process, and includes multi-objective optimisation capabilities.

translated by 谷歌翻译

Task-Directed Exploration in Continuous POMDPs for Robotic Manipulation of Articulated Objects

Aidan Curtis , Leslie Kaelbling , Siddarth Jain

分类：机器人

2022-12-08

Representing and reasoning about uncertainty is crucial for autonomous agents acting in partially observable environments with noisy sensors. Partially observable Markov decision processes (POMDPs) serve as a general framework for representing problems in which uncertainty is an important factor. Online sample-based POMDP methods have emerged as efficient approaches to solving large POMDPs and have been shown to extend to continuous domains. However, these solutions struggle to find long-horizon plans in problems with significant uncertainty. Exploration heuristics can help guide planning, but many real-world settings contain significant task-irrelevant uncertainty that might distract from the task objective. In this paper, we propose STRUG, an online POMDP solver capable of handling domains that require long-horizon planning with significant task-relevant and task-irrelevant uncertainty. We demonstrate our solution on several temporally extended versions of toy POMDP problems as well as robotic manipulation of articulated objects using a neural perception frontend to construct a distribution of possible models. Our results show that STRUG outperforms the current sample-based online POMDP solvers on several tasks.

translated by 谷歌翻译

Visibility-Aware Navigation Among Movable Obstacles

Jose Muguira-Iturralde , Aidan Curtis , Yilun Du , Leslie Pack Kaelbling , Tomás Lozano-Pérez

分类：机器人

2022-12-06

In this paper, we examine the problem of visibility-aware robot navigation among movable obstacles (VANAMO). A variant of the well-known NAMO robotic planning problem, VANAMO puts additional visibility constraints on robot motion and object movability. This new problem formulation lifts the restrictive assumption that the map is fully visible and the object positions are fully known. We provide a formal definition of the VANAMO problem and propose the Look and Manipulate Backchaining (LaMB) algorithm for solving such problems. LaMB has a simple vision-based API that makes it more easily transferable to real-world robot applications and scales to the large 3D environments. To evaluate LaMB, we construct a set of tasks that illustrate the complex interplay between visibility and object movability that can arise in mobile base manipulation problems in unknown environments. We show that LaMB outperforms NAMO and visibility-aware motion planning approaches as well as simple combinations of them on complex manipulation problems with partial observability.

translated by 谷歌翻译

Problem Behaviors Recognition in Videos using Language-Assisted Deep Learning Model for Children with Autism

Andong Deng , Taojiannan Yang , Chen Chen , Qian Chen , Leslie Neely , Sakiko Oyama

分类：计算机视觉

2022-11-17

Correctly recognizing the behaviors of children with Autism Spectrum Disorder (ASD) is of vital importance for the diagnosis of Autism and timely early intervention. However, the observation and recording during the treatment from the parents of autistic children may not be accurate and objective. In such cases, automatic recognition systems based on computer vision and machine learning (in particular deep learning) technology can alleviate this issue to a large extent. Existing human action recognition models can now achieve persuasive performance on challenging activity datasets, e.g. daily activity, and sports activity. However, problem behaviors in children with ASD are very different from these general activities, and recognizing these problem behaviors via computer vision is less studied. In this paper, we first evaluate a strong baseline for action recognition, i.e. Video Swin Transformer, on two autism behaviors datasets (SSBD and ESBD) and show that it can achieve high accuracy and outperform the previous methods by a large margin, demonstrating the feasibility of vision-based problem behaviors recognition. Moreover, we propose language-assisted training to further enhance the action recognition performance. Specifically, we develop a two-branch multimodal deep learning framework by incorporating the "freely available" language description for each type of problem behavior. Experimental results demonstrate that incorporating additional language supervision can bring an obvious performance boost for the autism problem behaviors recognition task as compared to using the video information only (i.e. 3.49% improvement on ESBD and 1.46% on SSBD).

translated by 谷歌翻译

Meta-simulation for the Automated Design of Synthetic Overhead Imagery

Handi Yu , Leslie M. Collins , Jordan M. Malof

分类：计算机视觉

2022-09-19

近年来，合成（或模拟）数据用于培训机器学习模型已迅速增长。通常，合成数据可以比其现实世界中的对应物更快，更便宜。但是，使用合成图像的一个挑战是场景设计：例如，内容及其特征和空间布置的选择。为了有效，该设计不仅必须现实，而且适合目标域，而目标域（通过假设）是未标记的。在这项工作中，我们提出了一种方法，可以自动根据未标记的现实世界图像选择合成图像的设计。我们的方法被称为神经 - 异位元模拟（NAM），建立在开创性的元模拟方法上。与当前的最新方法相反，我们的方法可以在离线后进行预训练，然后为新目标图像提供快速的设计推断。使用合成和现实世界中的问题，我们表明，NAMS不符合符合内域和室外目标成像的合成设计，并且具有NAMS设计的图像的训练分割模型与NA \ \ na \'相比，结果均优异。 IVE随机设计和最先进的元模拟方法。

translated by 谷歌翻译

Alexa, Let's Work Together: Introducing the First Alexa Prize TaskBot Challenge on Conversational Task Assistance

Anna Gottardi , Osman Ipek , Giuseppe Castellucci , Shui Hu , Lavina Vaz , Yao Lu , Anju Khatri , Anjali Chadha , Desheng Zhang , Sattvik Sahai

分类：自然语言处理 | 人工智能

2022-09-13

自2016年成立以来，Alexa奖计划使数百名大学生能够通过Socialbot Grand Challenge探索和竞争以发展对话代理商。挑战的目的是建立能够与人类在流行主题上连贯而诱人的代理人20分钟，同时达到至少4.0/5.0的平均评分。但是，由于对话代理商试图帮助用户完成日益复杂的任务，因此需要新的对话AI技术和评估平台。成立于2021年的Alexa奖Taskbot Challenge建立在Socialbot Challenge的成功基础上，通过引入交互式协助人类进行现实世界烹饪和做自己动手做的任务的要求，同时同时使用语音和视觉方式。这项挑战要求TaskBots识别和理解用户的需求，识别和集成任务和域知识，并开发新的方式，不分散用户的注意力，而不必分散他们的任务，以及其他挑战。本文概述了Taskbot挑战赛，描述了使用Cobot Toolkit提供给团队提供的基础架构支持，并总结了参与团队以克服研究挑战所采取的方法。最后，它分析了比赛第一年的竞争任务机器人的性能。

translated by 谷歌翻译

Learning Operators with Ignore Effects for Bilevel Planning in Continuous Domains

Nishanth Kumar , Willie McClinton , Rohan Chitnis , Tom Silver , Tomás Lozano-Pérez , Leslie Pack Kaelbling

分类：人工智能 | 机器学习 | 机器人

2022-08-16

在环境抽象中进行高级搜索来指导低水平决策，这是一种有效的方法，是解决连续状态和行动空间中的长途任务的有效方法。最近的工作表明，可以以符号操作员和神经采样器的形式学习使这种二聚体计划的动作抽象，并且鉴于实现已知目标的符号谓词和演示。在这项工作中，我们表明，在动作往往会导致大量谓词发生变化的环境中，现有的方法不足。为了解决这个问题，我们建议学习具有忽略效果的操作员。激发我们方法的关键思想是，对谓词的每一个观察到的变化进行建模是不必要的。唯一需要建模的更改是高级搜索以实现指定目标所需的更改。在实验上，我们表明我们的方法能够学习具有忽略六个混合机器人域效果的操作员，这些企业能够解决一个代理，以解决具有不同初始状态，目标和对象数量的新任务变化，比几个基线要高得多。

translated by 谷歌翻译